Overview

Dataset statistics

Number of variables30
Number of observations3901
Missing cells50824
Missing cells (%)43.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory914.4 KiB
Average record size in memory240.0 B

Variable types

Numeric18
Categorical12

Alerts

Payment Method Requested Code has constant value "99.0" Constant
Company Name has a high cardinality: 1196 distinct values High cardinality
Address has a high cardinality: 1176 distinct values High cardinality
City has a high cardinality: 371 distinct values High cardinality
State has a high cardinality: 57 distinct values High cardinality
Nif has a high cardinality: 3901 distinct values High cardinality
Unnamed: 0 is highly correlated with Payment Method Granted and 3 other fieldsHigh correlation
Postal Code is highly correlated with StateHigh correlation
Credit Limit Requested is highly correlated with Payment Method Requested CodeHigh correlation
Credit Limit Granted is highly correlated with Commercial Risk Cover Protected and 7 other fieldsHigh correlation
Commercial Risk Group Code is highly correlated with Credit Limit Granted and 8 other fieldsHigh correlation
Classification Decision Code is highly correlated with Credit Limit Granted and 8 other fieldsHigh correlation
Request Entry Date is highly correlated with Unnamed: 0 and 7 other fieldsHigh correlation
Effective Date is highly correlated with Payment Method Granted and 2 other fieldsHigh correlation
ClientePadre is highly correlated with Unnamed: 0 and 3 other fieldsHigh correlation
IDCliente is highly correlated with Unnamed: 0 and 3 other fieldsHigh correlation
VENTASCLIENTE is highly correlated with NArticulos and 1 other fieldsHigh correlation
MBAvg is highly correlated with MBMinHigh correlation
MBMax is highly correlated with PrecMedArticuloHigh correlation
MBMin is highly correlated with MBAvgHigh correlation
NArticulos is highly correlated with VENTASCLIENTE and 1 other fieldsHigh correlation
NContratos is highly correlated with VENTASCLIENTE and 1 other fieldsHigh correlation
PrecMedArticulo is highly correlated with MBMaxHigh correlation
DifCPrevCReal is highly correlated with Payment Method Requested CodeHigh correlation
State is highly correlated with Postal CodeHigh correlation
Payment Method Requested Code is highly correlated with Payment Method Granted and 6 other fieldsHigh correlation
Commercial Risk Cover Protected is highly correlated with Credit Limit Granted and 7 other fieldsHigh correlation
Payment Method Granted is highly correlated with Unnamed: 0 and 12 other fieldsHigh correlation
Payment Terms Granted is highly correlated with Credit Limit Granted and 7 other fieldsHigh correlation
Status Code is highly correlated with Credit Limit Granted and 7 other fieldsHigh correlation
Status is highly correlated with Credit Limit Granted and 7 other fieldsHigh correlation
Classification Decision is highly correlated with Credit Limit Granted and 9 other fieldsHigh correlation
Company Name has 2705 (69.3%) missing values Missing
Address has 2722 (69.8%) missing values Missing
City has 2726 (69.9%) missing values Missing
State has 2714 (69.6%) missing values Missing
Postal Code has 2707 (69.4%) missing values Missing
Credit Limit Requested has 2705 (69.3%) missing values Missing
Payment Method Requested Code has 2705 (69.3%) missing values Missing
Credit Limit Granted has 2705 (69.3%) missing values Missing
Commercial Risk Cover Protected has 2705 (69.3%) missing values Missing
Commercial Risk Group Code has 2705 (69.3%) missing values Missing
Payment Method Granted has 2705 (69.3%) missing values Missing
Payment Terms Granted has 2705 (69.3%) missing values Missing
Status Code has 2705 (69.3%) missing values Missing
Status has 2705 (69.3%) missing values Missing
Classification Decision Code has 2705 (69.3%) missing values Missing
Classification Decision has 2839 (72.8%) missing values Missing
Request Entry Date has 2705 (69.3%) missing values Missing
Effective Date has 2705 (69.3%) missing values Missing
DifCPrevCReal has 1951 (50.0%) missing values Missing
MBMax is highly skewed (γ1 = -32.60607807) Skewed
DifCPrevCReal is highly skewed (γ1 = 41.11657782) Skewed
Unnamed: 0 is uniformly distributed Uniform
Company Name is uniformly distributed Uniform
Address is uniformly distributed Uniform
Nif is uniformly distributed Uniform
Unnamed: 0 has unique values Unique
Nif has unique values Unique
ClientePadre has unique values Unique
IDCliente has unique values Unique
Credit Limit Granted has 626 (16.0%) zeros Zeros
Commercial Risk Group Code has 676 (17.3%) zeros Zeros
Classification Decision Code has 134 (3.4%) zeros Zeros
VENTASCLIENTE has 52 (1.3%) zeros Zeros
MBMin has 170 (4.4%) zeros Zeros
PrecMedArticulo has 52 (1.3%) zeros Zeros
DifCPrevCReal has 231 (5.9%) zeros Zeros

Reproduction

Analysis started2022-10-30 09:47:37.407737
Analysis finished2022-10-30 09:48:47.630812
Duration1 minute and 10.22 seconds
Software versionpandas-profiling v3.4.0
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE

Distinct3901
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1950
Minimum0
Maximum3900
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum0
5-th percentile195
Q1975
median1950
Q32925
95-th percentile3705
Maximum3900
Range3900
Interquartile range (IQR)1950

Descriptive statistics

Standard deviation1126.266028
Coefficient of variation (CV)0.5775723222
Kurtosis-1.2
Mean1950
Median Absolute Deviation (MAD)975
Skewness0
Sum7606950
Variance1268475.167
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
26211
 
< 0.1%
25931
 
< 0.1%
25941
 
< 0.1%
25951
 
< 0.1%
25961
 
< 0.1%
25971
 
< 0.1%
25981
 
< 0.1%
25991
 
< 0.1%
26001
 
< 0.1%
Other values (3891)3891
99.7%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
39001
< 0.1%
38991
< 0.1%
38981
< 0.1%
38971
< 0.1%
38961
< 0.1%
38951
< 0.1%
38941
< 0.1%
38931
< 0.1%
38921
< 0.1%
38911
< 0.1%

Company Name
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct1196
Distinct (%)100.0%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
ARNELA DISEÃ O, S.L.
 
1
VILLAMAR CRESPO SL
 
1
CONSTRUCCIONES OREGA SL
 
1
DISTRIBUCIONES FROIZ SA
 
1
C. Y P. LOS OBELISCOS S.L.
 
1
Other values (1191)
1191 

Length

Max length100
Median length55
Mean length26.98076923
Min length4

Characters and Unicode

Total characters32269
Distinct characters54
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1196 ?
Unique (%)100.0%

Sample

1st rowINDUSTRIAS CANDIDO HERMIDA SL
2nd rowESTRUCTURAS DE HORMIGON Y CONSTRUCCIONES ALPA SL
3rd rowHIJOS JOSE LOSADA CANCELO SA
4th rowCAMUYDE S.L.
5th rowGABADI SL

Common Values

ValueCountFrequency (%)
ARNELA DISEÃ O, S.L.1
 
< 0.1%
VILLAMAR CRESPO SL1
 
< 0.1%
CONSTRUCCIONES OREGA SL1
 
< 0.1%
DISTRIBUCIONES FROIZ SA1
 
< 0.1%
C. Y P. LOS OBELISCOS S.L.1
 
< 0.1%
FACTUM INVERSIONES Y PROYECTOS SL.1
 
< 0.1%
HIJOS DE VALEIRAS Y ALONSO SOCIEDAD LIMITADA.1
 
< 0.1%
TECBETON CONSTRUCCIONES S.L.1
 
< 0.1%
RE-CORTA, DEMOLICION TECNICA, SL.1
 
< 0.1%
SERFONOR MEDIOAMBIENTE SOCIEDAD LIMITADA.1
 
< 0.1%
Other values (1186)1186
30.4%
(Missing)2705
69.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
sl532
 
11.2%
s.l262
 
5.5%
y205
 
4.3%
sa142
 
3.0%
de136
 
2.9%
sociedad115
 
2.4%
construcciones105
 
2.2%
limitada104
 
2.2%
servicios56
 
1.2%
obras51
 
1.1%
Other values (1881)3045
64.1%

Most occurring characters

ValueCountFrequency (%)
3567
11.1%
S3173
9.8%
A3132
 
9.7%
E2550
 
7.9%
I2372
 
7.4%
O2323
 
7.2%
L2009
 
6.2%
R1931
 
6.0%
C1746
 
5.4%
N1618
 
5.0%
Other values (44)7848
24.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter27316
84.7%
Space Separator3567
 
11.1%
Other Punctuation1222
 
3.8%
Decimal Number139
 
0.4%
Dash Punctuation15
 
< 0.1%
Modifier Symbol3
 
< 0.1%
Lowercase Letter3
 
< 0.1%
Open Punctuation2
 
< 0.1%
Close Punctuation2
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S3173
11.6%
A3132
11.5%
E2550
9.3%
I2372
8.7%
O2323
8.5%
L2009
 
7.4%
R1931
 
7.1%
C1746
 
6.4%
N1618
 
5.9%
T1327
 
4.9%
Other values (20)5135
18.8%
Decimal Number
ValueCountFrequency (%)
036
25.9%
232
23.0%
123
16.5%
89
 
6.5%
59
 
6.5%
97
 
5.0%
67
 
5.0%
77
 
5.0%
45
 
3.6%
34
 
2.9%
Other Punctuation
ValueCountFrequency (%)
.1046
85.6%
,164
 
13.4%
&7
 
0.6%
?2
 
0.2%
*1
 
0.1%
/1
 
0.1%
#1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
ó2
66.7%
ñ1
33.3%
Space Separator
ValueCountFrequency (%)
3567
100.0%
Dash Punctuation
ValueCountFrequency (%)
-15
100.0%
Modifier Symbol
ValueCountFrequency (%)
`3
100.0%
Open Punctuation
ValueCountFrequency (%)
(2
100.0%
Close Punctuation
ValueCountFrequency (%)
)2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin27319
84.7%
Common4950
 
15.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
S3173
11.6%
A3132
11.5%
E2550
9.3%
I2372
8.7%
O2323
8.5%
L2009
 
7.4%
R1931
 
7.1%
C1746
 
6.4%
N1618
 
5.9%
T1327
 
4.9%
Other values (22)5138
18.8%
Common
ValueCountFrequency (%)
3567
72.1%
.1046
 
21.1%
,164
 
3.3%
036
 
0.7%
232
 
0.6%
123
 
0.5%
-15
 
0.3%
89
 
0.2%
59
 
0.2%
97
 
0.1%
Other values (12)42
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII32217
99.8%
None52
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3567
11.1%
S3173
9.8%
A3132
9.7%
E2550
 
7.9%
I2372
 
7.4%
O2323
 
7.2%
L2009
 
6.2%
R1931
 
6.0%
C1746
 
5.4%
N1618
 
5.0%
Other values (38)7796
24.2%
None
ValueCountFrequency (%)
Ñ28
53.8%
Ã18
34.6%
Â2
 
3.8%
ó2
 
3.8%
Ó1
 
1.9%
ñ1
 
1.9%

Address
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct1176
Distinct (%)99.7%
Missing2722
Missing (%)69.8%
Memory size30.6 KiB
000
 
3
MUELLE COMERCIAL S/N, 0000
 
2
ALONSO OJEDA, 18 - BJ
 
1
ANURIÃ AS 14
 
1
EMILIA PARDO BAZAN 25
 
1
Other values (1171)
1171 

Length

Max length55
Median length42
Mean length23.98473282
Min length1

Characters and Unicode

Total characters28278
Distinct characters71
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1174 ?
Unique (%)99.6%

Sample

1st rowO CALVARIO S/N
2nd rowVILLA SOLEDAD 2
3rd rowPERBES 35
4th rowCTRA CEDEIRA KM ,
5th rowO ESPIÑO

Common Values

ValueCountFrequency (%)
0003
 
0.1%
MUELLE COMERCIAL S/N, 00002
 
0.1%
ALONSO OJEDA, 18 - BJ1
 
< 0.1%
ANURIÃ AS 141
 
< 0.1%
EMILIA PARDO BAZAN 251
 
< 0.1%
LOURIDO 151
 
< 0.1%
FUEROS DE LEON , 001
 
< 0.1%
PRINCIPE, 43 - PISO 1 C1
 
< 0.1%
CURROS ENRIQUEZ, 29 - BJ1
 
< 0.1%
CANTARRANA, 13 - ENT1
 
< 0.1%
Other values (1166)1166
29.9%
(Missing)2722
69.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
356
 
6.7%
de244
 
4.6%
1109
 
2.0%
piso79
 
1.5%
277
 
1.4%
372
 
1.3%
a71
 
1.3%
bj69
 
1.3%
s/n68
 
1.3%
la58
 
1.1%
Other values (1708)4131
77.4%

Most occurring characters

ValueCountFrequency (%)
5921
20.9%
A2845
 
10.1%
E1743
 
6.2%
O1646
 
5.8%
R1532
 
5.4%
I1274
 
4.5%
L1273
 
4.5%
N1151
 
4.1%
S1046
 
3.7%
D934
 
3.3%
Other values (61)8913
31.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter18415
65.1%
Space Separator5921
 
20.9%
Decimal Number2319
 
8.2%
Other Punctuation853
 
3.0%
Dash Punctuation357
 
1.3%
Open Punctuation151
 
0.5%
Close Punctuation149
 
0.5%
Other Letter74
 
0.3%
Lowercase Letter39
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A2845
15.4%
E1743
9.5%
O1646
 
8.9%
R1532
 
8.3%
I1274
 
6.9%
L1273
 
6.9%
N1151
 
6.3%
S1046
 
5.7%
D934
 
5.1%
C831
 
4.5%
Other values (23)4140
22.5%
Lowercase Letter
ValueCountFrequency (%)
o6
15.4%
a4
 
10.3%
e3
 
7.7%
c3
 
7.7%
r3
 
7.7%
í2
 
5.1%
m2
 
5.1%
ñ2
 
5.1%
d2
 
5.1%
i2
 
5.1%
Other values (7)10
25.6%
Decimal Number
ValueCountFrequency (%)
1478
20.6%
0385
16.6%
2343
14.8%
3275
11.9%
5186
 
8.0%
4164
 
7.1%
6163
 
7.0%
8124
 
5.3%
7121
 
5.2%
980
 
3.4%
Other Punctuation
ValueCountFrequency (%)
,535
62.7%
.179
 
21.0%
/134
 
15.7%
?5
 
0.6%
Open Punctuation
ValueCountFrequency (%)
(150
99.3%
{1
 
0.7%
Other Letter
ValueCountFrequency (%)
º71
95.9%
ª3
 
4.1%
Space Separator
ValueCountFrequency (%)
5921
100.0%
Dash Punctuation
ValueCountFrequency (%)
-357
100.0%
Close Punctuation
ValueCountFrequency (%)
)149
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin18528
65.5%
Common9750
34.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
A2845
15.4%
E1743
9.4%
O1646
 
8.9%
R1532
 
8.3%
I1274
 
6.9%
L1273
 
6.9%
N1151
 
6.2%
S1046
 
5.6%
D934
 
5.0%
C831
 
4.5%
Other values (42)4253
23.0%
Common
ValueCountFrequency (%)
5921
60.7%
,535
 
5.5%
1478
 
4.9%
0385
 
3.9%
-357
 
3.7%
2343
 
3.5%
3275
 
2.8%
5186
 
1.9%
.179
 
1.8%
4164
 
1.7%
Other values (9)927
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII28124
99.5%
None154
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5921
21.1%
A2845
 
10.1%
E1743
 
6.2%
O1646
 
5.9%
R1532
 
5.4%
I1274
 
4.5%
L1273
 
4.5%
N1151
 
4.1%
S1046
 
3.7%
D934
 
3.3%
Other values (47)8759
31.1%
None
ValueCountFrequency (%)
º71
46.1%
Ã31
20.1%
Ñ26
 
16.9%
Â6
 
3.9%
Ó5
 
3.2%
ª3
 
1.9%
í2
 
1.3%
ñ2
 
1.3%
é2
 
1.3%
á2
 
1.3%
Other values (4)4
 
2.6%

City
Categorical

HIGH CARDINALITY
MISSING

Distinct371
Distinct (%)31.6%
Missing2726
Missing (%)69.9%
Memory size30.6 KiB
PONFERRADA
 
85
FERROL
 
70
NARON
 
64
MADRID
 
57
A CORUÃ A
 
36
Other values (366)
863 

Length

Max length30
Median length26
Mean length8.77106383
Min length1

Characters and Unicode

Total characters10306
Distinct characters56
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)21.5%

Sample

1st rowVALDOVIEO
2nd rowEL FERROL
3rd rowEL FERROL
4th rowNARON
5th rowFERROL

Common Values

ValueCountFrequency (%)
PONFERRADA85
 
2.2%
FERROL70
 
1.8%
NARON64
 
1.6%
MADRID57
 
1.5%
A CORUÃ A36
 
0.9%
VIGO28
 
0.7%
ARTEIXO23
 
0.6%
VIVEIRO23
 
0.6%
LUGO22
 
0.6%
LA CORUÑA20
 
0.5%
Other values (361)747
 
19.1%
(Missing)2726
69.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
a139
 
8.0%
ponferrada86
 
4.9%
de83
 
4.8%
ferrol81
 
4.6%
naron64
 
3.7%
madrid59
 
3.4%
coruña50
 
2.9%
la41
 
2.4%
coruã40
 
2.3%
vigo28
 
1.6%
Other values (409)1073
61.5%

Most occurring characters

ValueCountFrequency (%)
A1514
14.7%
R1169
11.3%
O1088
10.6%
E956
9.3%
L598
 
5.8%
N581
 
5.6%
577
 
5.6%
D543
 
5.3%
I499
 
4.8%
C395
 
3.8%
Other values (46)2386
23.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter9655
93.7%
Space Separator577
 
5.6%
Dash Punctuation19
 
0.2%
Lowercase Letter18
 
0.2%
Other Punctuation13
 
0.1%
Decimal Number13
 
0.1%
Open Punctuation6
 
0.1%
Close Punctuation5
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A1514
15.7%
R1169
12.1%
O1088
11.3%
E956
9.9%
L598
 
6.2%
N581
 
6.0%
D543
 
5.6%
I499
 
5.2%
C395
 
4.1%
S350
 
3.6%
Other values (21)1962
20.3%
Lowercase Letter
ValueCountFrequency (%)
ñ6
33.3%
a3
16.7%
l2
 
11.1%
â1
 
5.6%
u1
 
5.6%
r1
 
5.6%
o1
 
5.6%
e1
 
5.6%
v1
 
5.6%
i1
 
5.6%
Decimal Number
ValueCountFrequency (%)
03
23.1%
53
23.1%
22
15.4%
12
15.4%
41
 
7.7%
71
 
7.7%
31
 
7.7%
Other Punctuation
ValueCountFrequency (%)
?5
38.5%
/4
30.8%
,2
 
15.4%
.2
 
15.4%
Space Separator
ValueCountFrequency (%)
577
100.0%
Dash Punctuation
ValueCountFrequency (%)
-19
100.0%
Open Punctuation
ValueCountFrequency (%)
(6
100.0%
Close Punctuation
ValueCountFrequency (%)
)5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin9673
93.9%
Common633
 
6.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
A1514
15.7%
R1169
12.1%
O1088
11.2%
E956
9.9%
L598
 
6.2%
N581
 
6.0%
D543
 
5.6%
I499
 
5.2%
C395
 
4.1%
S350
 
3.6%
Other values (31)1980
20.5%
Common
ValueCountFrequency (%)
577
91.2%
-19
 
3.0%
(6
 
0.9%
?5
 
0.8%
)5
 
0.8%
/4
 
0.6%
03
 
0.5%
53
 
0.5%
22
 
0.3%
12
 
0.3%
Other values (5)7
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII10172
98.7%
None134
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A1514
14.9%
R1169
11.5%
O1088
10.7%
E956
9.4%
L598
 
5.9%
N581
 
5.7%
577
 
5.7%
D543
 
5.3%
I499
 
4.9%
C395
 
3.9%
Other values (38)2252
22.1%
None
ValueCountFrequency (%)
Ã60
44.8%
Ñ59
44.0%
ñ6
 
4.5%
Â4
 
3.0%
Ü2
 
1.5%
â1
 
0.7%
Ó1
 
0.7%
È1
 
0.7%

State
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING

Distinct57
Distinct (%)4.8%
Missing2714
Missing (%)69.6%
Memory size30.6 KiB
LA CORUñA
435 
LEON
180 
LUGO
121 
MADRID
89 
PONTEVEDRA
76 
Other values (52)
286 

Length

Max length12
Median length11
Mean length7.291491154
Min length4

Characters and Unicode

Total characters8655
Distinct characters40
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)1.8%

Sample

1st rowLA CORUÑA
2nd rowLA CORU A
3rd rowLA CORU A
4th rowLA CORUñA
5th rowLA CORUÑA

Common Values

ValueCountFrequency (%)
LA CORUñA435
 
11.2%
LEON180
 
4.6%
LUGO121
 
3.1%
MADRID89
 
2.3%
PONTEVEDRA76
 
1.9%
LA CORUÑA57
 
1.5%
ORENSE39
 
1.0%
ASTURIAS27
 
0.7%
LA CORU A18
 
0.5%
BARCELONA17
 
0.4%
Other values (47)128
 
3.3%
(Missing)2714
69.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
la515
29.7%
coruña505
29.1%
leon180
 
10.4%
lugo122
 
7.0%
madrid91
 
5.3%
pontevedra76
 
4.4%
orense39
 
2.3%
asturias28
 
1.6%
a27
 
1.6%
coru18
 
1.0%
Other values (41)132
 
7.6%

Most occurring characters

ValueCountFrequency (%)
A1498
17.3%
O991
11.5%
L904
10.4%
R824
9.5%
U695
8.0%
C588
 
6.8%
547
 
6.3%
E467
 
5.4%
ñ440
 
5.1%
N345
 
4.0%
Other values (30)1356
15.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter7624
88.1%
Space Separator547
 
6.3%
Lowercase Letter484
 
5.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A1498
19.6%
O991
13.0%
L904
11.9%
R824
10.8%
U695
9.1%
C588
 
7.7%
E467
 
6.1%
N345
 
4.5%
D283
 
3.7%
I188
 
2.5%
Other values (14)841
11.0%
Lowercase Letter
ValueCountFrequency (%)
ñ440
90.9%
a10
 
2.1%
r6
 
1.2%
u5
 
1.0%
d4
 
0.8%
i4
 
0.8%
o4
 
0.8%
e2
 
0.4%
s2
 
0.4%
l2
 
0.4%
Other values (5)5
 
1.0%
Space Separator
ValueCountFrequency (%)
547
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8108
93.7%
Common547
 
6.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
A1498
18.5%
O991
12.2%
L904
11.1%
R824
10.2%
U695
8.6%
C588
 
7.3%
E467
 
5.8%
ñ440
 
5.4%
N345
 
4.3%
D283
 
3.5%
Other values (29)1073
13.2%
Common
ValueCountFrequency (%)
547
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII8147
94.1%
None508
 
5.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A1498
18.4%
O991
12.2%
L904
11.1%
R824
10.1%
U695
8.5%
C588
 
7.2%
547
 
6.7%
E467
 
5.7%
N345
 
4.2%
D283
 
3.5%
Other values (26)1005
12.3%
None
ValueCountFrequency (%)
ñ440
86.6%
Ñ66
 
13.0%
ó1
 
0.2%
Ó1
 
0.2%

Postal Code
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct468
Distinct (%)39.2%
Missing2707
Missing (%)69.4%
Infinite0
Infinite (%)0.0%
Mean22777.01508
Minimum2400
Maximum50197
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum2400
5-th percentile15000
Q115320
median24193
Q328009
95-th percentile36810.5
Maximum50197
Range47797
Interquartile range (IQR)12689

Descriptive statistics

Standard deviation8900.51262
Coefficient of variation (CV)0.3907672972
Kurtosis0.1397390441
Mean22777.01508
Median Absolute Deviation (MAD)8623
Skewness0.7268860837
Sum27195756
Variance79219124.91
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2440044
 
1.1%
1557035
 
0.9%
1500034
 
0.9%
2785029
 
0.7%
1519024
 
0.6%
1500821
 
0.5%
1532018
 
0.5%
1514216
 
0.4%
2440216
 
0.4%
1540515
 
0.4%
Other values (458)942
 
24.1%
(Missing)2707
69.4%
ValueCountFrequency (%)
24001
< 0.1%
24401
< 0.1%
32951
< 0.1%
33211
< 0.1%
36001
< 0.1%
36901
< 0.1%
38041
< 0.1%
42401
< 0.1%
46102
0.1%
70031
< 0.1%
ValueCountFrequency (%)
501971
< 0.1%
500031
< 0.1%
496961
< 0.1%
489801
< 0.1%
489701
< 0.1%
489602
0.1%
489501
< 0.1%
487001
< 0.1%
485301
< 0.1%
481701
< 0.1%

Credit Limit Requested
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct6
Distinct (%)0.5%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean44928.92977
Minimum10000
Maximum70000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum10000
5-th percentile45000
Q145000
median45000
Q345000
95-th percentile45000
Maximum70000
Range60000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1775.933729
Coefficient of variation (CV)0.03952762148
Kurtosis270.0713326
Mean44928.92977
Median Absolute Deviation (MAD)0
Skewness-11.53372543
Sum53735000
Variance3153940.611
MonotonicityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
450001191
30.5%
700001
 
< 0.1%
200001
 
< 0.1%
250001
 
< 0.1%
150001
 
< 0.1%
100001
 
< 0.1%
(Missing)2705
69.3%
ValueCountFrequency (%)
100001
 
< 0.1%
150001
 
< 0.1%
200001
 
< 0.1%
250001
 
< 0.1%
450001191
30.5%
700001
 
< 0.1%
ValueCountFrequency (%)
700001
 
< 0.1%
450001191
30.5%
250001
 
< 0.1%
200001
 
< 0.1%
150001
 
< 0.1%
100001
 
< 0.1%

Payment Method Requested Code
Categorical

CONSTANT
HIGH CORRELATION
MISSING
REJECTED

Distinct1
Distinct (%)0.1%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
99.0
1196 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4784
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row99.0
2nd row99.0
3rd row99.0
4th row99.0
5th row99.0

Common Values

ValueCountFrequency (%)
99.01196
30.7%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
99.01196
100.0%

Most occurring characters

ValueCountFrequency (%)
92392
50.0%
.1196
25.0%
01196
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3588
75.0%
Other Punctuation1196
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
92392
66.7%
01196
33.3%
Other Punctuation
ValueCountFrequency (%)
.1196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4784
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
92392
50.0%
.1196
25.0%
01196
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII4784
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92392
50.0%
.1196
25.0%
01196
25.0%

Credit Limit Granted
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct17
Distinct (%)1.4%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean16740.80268
Minimum0
Maximum45000
Zeros626
Zeros (%)16.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q345000
95-th percentile45000
Maximum45000
Range45000
Interquartile range (IQR)45000

Descriptive statistics

Standard deviation20156.11972
Coefficient of variation (CV)1.204011547
Kurtosis-1.541095265
Mean16740.80268
Median Absolute Deviation (MAD)0
Skewness0.5619382697
Sum20022000
Variance406269162.2
MonotonicityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0626
 
16.0%
45000370
 
9.5%
1000057
 
1.5%
2000037
 
0.9%
2500019
 
0.5%
1500017
 
0.4%
500014
 
0.4%
800010
 
0.3%
280007
 
0.2%
300006
 
0.2%
Other values (7)33
 
0.8%
(Missing)2705
69.3%
ValueCountFrequency (%)
0626
16.0%
30002
 
0.1%
500014
 
0.4%
800010
 
0.3%
1000057
 
1.5%
120005
 
0.1%
1500017
 
0.4%
180006
 
0.2%
2000037
 
0.9%
220005
 
0.1%
ValueCountFrequency (%)
45000370
9.5%
380004
 
0.1%
350006
 
0.2%
320005
 
0.1%
300006
 
0.2%
280007
 
0.2%
2500019
 
0.5%
220005
 
0.1%
2000037
 
0.9%
180006
 
0.2%

Commercial Risk Cover Protected
Categorical

HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.3%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
0.0
626 
95.0
561 
50.0
 
9

Length

Max length4
Median length3
Mean length3.476588629
Min length3

Characters and Unicode

Total characters4158
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row95.0
4th row0.0
5th row95.0

Common Values

ValueCountFrequency (%)
0.0626
 
16.0%
95.0561
 
14.4%
50.09
 
0.2%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.0626
52.3%
95.0561
46.9%
50.09
 
0.8%

Most occurring characters

ValueCountFrequency (%)
01831
44.0%
.1196
28.8%
5570
 
13.7%
9561
 
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number2962
71.2%
Other Punctuation1196
28.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01831
61.8%
5570
 
19.2%
9561
 
18.9%
Other Punctuation
ValueCountFrequency (%)
.1196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4158
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01831
44.0%
.1196
28.8%
5570
 
13.7%
9561
 
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII4158
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01831
44.0%
.1196
28.8%
5570
 
13.7%
9561
 
13.5%

Commercial Risk Group Code
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct8
Distinct (%)0.7%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean1.689799331
Minimum0
Maximum7
Zeros676
Zeros (%)17.3%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile6
Maximum7
Range7
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.207616248
Coefficient of variation (CV)1.306436929
Kurtosis-0.8857181646
Mean1.689799331
Median Absolute Deviation (MAD)0
Skewness0.8369343879
Sum2021
Variance4.873569499
MonotonicityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0676
 
17.3%
5162
 
4.2%
491
 
2.3%
272
 
1.8%
368
 
1.7%
156
 
1.4%
654
 
1.4%
717
 
0.4%
(Missing)2705
69.3%
ValueCountFrequency (%)
0676
17.3%
156
 
1.4%
272
 
1.8%
368
 
1.7%
491
 
2.3%
5162
 
4.2%
654
 
1.4%
717
 
0.4%
ValueCountFrequency (%)
717
 
0.4%
654
 
1.4%
5162
 
4.2%
491
 
2.3%
368
 
1.7%
272
 
1.8%
156
 
1.4%
0676
17.3%

Payment Method Granted
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
0.0
626 
99.0
570 

Length

Max length4
Median length3
Mean length3.476588629
Min length3

Characters and Unicode

Total characters4158
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row99.0
4th row0.0
5th row99.0

Common Values

ValueCountFrequency (%)
0.0626
 
16.0%
99.0570
 
14.6%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.0626
52.3%
99.0570
47.7%

Most occurring characters

ValueCountFrequency (%)
01822
43.8%
.1196
28.8%
91140
27.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number2962
71.2%
Other Punctuation1196
28.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01822
61.5%
91140
38.5%
Other Punctuation
ValueCountFrequency (%)
.1196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4158
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
01822
43.8%
.1196
28.8%
91140
27.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII4158
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
01822
43.8%
.1196
28.8%
91140
27.4%

Payment Terms Granted
Categorical

HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.3%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
0.0
626 
180.0
569 
150.0
 
1

Length

Max length5
Median length3
Mean length3.953177258
Min length3

Characters and Unicode

Total characters4728
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row0.0
2nd row0.0
3rd row180.0
4th row0.0
5th row180.0

Common Values

ValueCountFrequency (%)
0.0626
 
16.0%
180.0569
 
14.6%
150.01
 
< 0.1%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.0626
52.3%
180.0569
47.6%
150.01
 
0.1%

Most occurring characters

ValueCountFrequency (%)
02392
50.6%
.1196
25.3%
1570
 
12.1%
8569
 
12.0%
51
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3532
74.7%
Other Punctuation1196
 
25.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02392
67.7%
1570
 
16.1%
8569
 
16.1%
51
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
.1196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4728
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02392
50.6%
.1196
25.3%
1570
 
12.1%
8569
 
12.0%
51
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII4728
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02392
50.6%
.1196
25.3%
1570
 
12.1%
8569
 
12.0%
51
 
< 0.1%

Status Code
Categorical

HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.3%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
2.0
626 
66.0
564 
8.0
 
6

Length

Max length4
Median length3
Mean length3.471571906
Min length3

Characters and Unicode

Total characters4152
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row2.0
3rd row66.0
4th row2.0
5th row66.0

Common Values

ValueCountFrequency (%)
2.0626
 
16.0%
66.0564
 
14.5%
8.06
 
0.2%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
2.0626
52.3%
66.0564
47.2%
8.06
 
0.5%

Most occurring characters

ValueCountFrequency (%)
.1196
28.8%
01196
28.8%
61128
27.2%
2626
15.1%
86
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number2956
71.2%
Other Punctuation1196
28.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
01196
40.5%
61128
38.2%
2626
21.2%
86
 
0.2%
Other Punctuation
ValueCountFrequency (%)
.1196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4152
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.1196
28.8%
01196
28.8%
61128
27.2%
2626
15.1%
86
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII4152
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.1196
28.8%
01196
28.8%
61128
27.2%
2626
15.1%
86
 
0.1%

Status
Categorical

HIGH CORRELATION
MISSING

Distinct3
Distinct (%)0.3%
Missing2705
Missing (%)69.3%
Memory size30.6 KiB
ANULADA
626 
EN VIGOR
564 
ANULADA-MANTENIMIENTO
 
6

Length

Max length21
Median length7
Mean length7.54180602
Min length7

Characters and Unicode

Total characters9020
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowANULADA
2nd rowANULADA
3rd rowEN VIGOR
4th rowANULADA
5th rowEN VIGOR

Common Values

ValueCountFrequency (%)
ANULADA626
 
16.0%
EN VIGOR564
 
14.5%
ANULADA-MANTENIMIENTO6
 
0.2%
(Missing)2705
69.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
anulada626
35.6%
en564
32.0%
vigor564
32.0%
anulada-mantenimiento6
 
0.3%

Most occurring characters

ValueCountFrequency (%)
A1902
21.1%
N1214
13.5%
U632
 
7.0%
L632
 
7.0%
D632
 
7.0%
E576
 
6.4%
I576
 
6.4%
O570
 
6.3%
564
 
6.3%
V564
 
6.3%
Other values (5)1158
12.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter8450
93.7%
Space Separator564
 
6.3%
Dash Punctuation6
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A1902
22.5%
N1214
14.4%
U632
 
7.5%
L632
 
7.5%
D632
 
7.5%
E576
 
6.8%
I576
 
6.8%
O570
 
6.7%
V564
 
6.7%
G564
 
6.7%
Other values (3)588
 
7.0%
Space Separator
ValueCountFrequency (%)
564
100.0%
Dash Punctuation
ValueCountFrequency (%)
-6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin8450
93.7%
Common570
 
6.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
A1902
22.5%
N1214
14.4%
U632
 
7.5%
L632
 
7.5%
D632
 
7.5%
E576
 
6.8%
I576
 
6.8%
O570
 
6.7%
V564
 
6.7%
G564
 
6.7%
Other values (3)588
 
7.0%
Common
ValueCountFrequency (%)
564
98.9%
-6
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII9020
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A1902
21.1%
N1214
13.5%
U632
 
7.0%
L632
 
7.0%
D632
 
7.0%
E576
 
6.4%
I576
 
6.4%
O570
 
6.3%
564
 
6.3%
V564
 
6.3%
Other values (5)1158
12.8%

Classification Decision Code
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct22
Distinct (%)1.8%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean31.73745819
Minimum0
Maximum80
Zeros134
Zeros (%)3.4%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q123
median32
Q346
95-th percentile46
Maximum80
Range80
Interquartile range (IQR)23

Descriptive statistics

Standard deviation16.01932833
Coefficient of variation (CV)0.5047451574
Kurtosis0.4355984877
Mean31.73745819
Median Absolute Deviation (MAD)11
Skewness-0.1874815949
Sum37958
Variance256.6188802
MonotonicityNot monotonic
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
46353
 
9.0%
23185
 
4.7%
32144
 
3.7%
0134
 
3.4%
3599
 
2.5%
2967
 
1.7%
2165
 
1.7%
3031
 
0.8%
6527
 
0.7%
3320
 
0.5%
Other values (12)71
 
1.8%
(Missing)2705
69.3%
ValueCountFrequency (%)
0134
3.4%
34
 
0.1%
73
 
0.1%
91
 
< 0.1%
2012
 
0.3%
2165
 
1.7%
23185
4.7%
241
 
< 0.1%
2514
 
0.4%
263
 
0.1%
ValueCountFrequency (%)
8012
 
0.3%
744
 
0.1%
6527
 
0.7%
583
 
0.1%
551
 
< 0.1%
46353
9.0%
4013
 
0.3%
3599
 
2.5%
3320
 
0.5%
32144
3.7%

Classification Decision
Categorical

HIGH CORRELATION
MISSING

Distinct21
Distinct (%)2.0%
Missing2839
Missing (%)72.8%
Memory size30.6 KiB
SOLICITADO POR EL ASEGURADO
353 
DECISION CESCE
185 
DIMENSIÓN REDUCIDA
144 
PÉRDIDAS EN EL ÚLTIMO EJERCICIO DISPONIBLE
99 
PATRIMONIO NETO NEGATIVO
67 
Other values (16)
214 

Length

Max length54
Median length52
Mean length27.64783427
Min length14

Characters and Unicode

Total characters29362
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st rowVOLUMEN DE RIESGO ELEVADO
2nd rowPÉRDIDAS EN EL ÚLTIMO EJERCICIO DISPONIBLE
3rd rowDECISION CESCE
4th rowANULACIÓN POR VENCIMIENTO DE LA VALIDEZ DEL SUPLEMENTO
5th rowSITUACIÓN FINANCIERA AJUSTADA

Common Values

ValueCountFrequency (%)
SOLICITADO POR EL ASEGURADO353
 
9.0%
DECISION CESCE185
 
4.7%
DIMENSIÓN REDUCIDA144
 
3.7%
PÉRDIDAS EN EL ÚLTIMO EJERCICIO DISPONIBLE99
 
2.5%
PATRIMONIO NETO NEGATIVO67
 
1.7%
EXPERIENCIA COMERCIAL NEGATIVA: INCIDENCIAS EN PAGOS65
 
1.7%
SITUACIÓN FINANCIERA AJUSTADA31
 
0.8%
ANULACIÓN POR VENCIMIENTO DE LA VALIDEZ DEL SUPLEMENTO27
 
0.7%
AUSENCIA DE DATOS FINANCIEROS DE LA SOCIEDAD20
 
0.5%
GRUPO ECONÓMICO EN SITUACIÓN DELICADA14
 
0.4%
Other values (11)57
 
1.5%
(Missing)2839
72.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
el452
 
11.3%
por381
 
9.5%
asegurado354
 
8.8%
solicitado353
 
8.8%
decision185
 
4.6%
cesce185
 
4.6%
en180
 
4.5%
reducida144
 
3.6%
dimensión144
 
3.6%
de103
 
2.6%
Other values (56)1519
38.0%

Most occurring characters

ValueCountFrequency (%)
I3330
11.3%
E3195
10.9%
2938
10.0%
O2619
8.9%
A2380
 
8.1%
D2028
 
6.9%
C1937
 
6.6%
S1808
 
6.2%
N1664
 
5.7%
R1419
 
4.8%
Other values (20)6044
20.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter26341
89.7%
Space Separator2938
 
10.0%
Other Punctuation83
 
0.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I3330
12.6%
E3195
12.1%
O2619
9.9%
A2380
9.0%
D2028
7.7%
C1937
7.4%
S1808
 
6.9%
N1664
 
6.3%
R1419
 
5.4%
L1302
 
4.9%
Other values (16)4659
17.7%
Other Punctuation
ValueCountFrequency (%)
:65
78.3%
/16
 
19.3%
.2
 
2.4%
Space Separator
ValueCountFrequency (%)
2938
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin26341
89.7%
Common3021
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
I3330
12.6%
E3195
12.1%
O2619
9.9%
A2380
9.0%
D2028
7.7%
C1937
7.4%
S1808
 
6.9%
N1664
 
6.3%
R1419
 
5.4%
L1302
 
4.9%
Other values (16)4659
17.7%
Common
ValueCountFrequency (%)
2938
97.3%
:65
 
2.2%
/16
 
0.5%
.2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII28890
98.4%
None472
 
1.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I3330
11.5%
E3195
11.1%
2938
10.2%
O2619
9.1%
A2380
8.2%
D2028
 
7.0%
C1937
 
6.7%
S1808
 
6.3%
N1664
 
5.8%
R1419
 
4.9%
Other values (15)5572
19.3%
None
ValueCountFrequency (%)
Ó258
54.7%
É99
 
21.0%
Ú99
 
21.0%
Í13
 
2.8%
Á3
 
0.6%

Request Entry Date
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct380
Distinct (%)31.8%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean20208125.43
Minimum20200318
Maximum20221007
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum20200318
5-th percentile20200318
Q120200608
median20210219.5
Q320211123
95-th percentile20220806.75
Maximum20221007
Range20689
Interquartile range (IQR)10515

Descriptive statistics

Standard deviation7954.607502
Coefficient of variation (CV)0.0003936341117
Kurtosis-1.285469961
Mean20208125.43
Median Absolute Deviation (MAD)9511.5
Skewness0.4635236596
Sum2.416891802 × 1010
Variance63275780.52
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200318179
 
4.6%
2020040946
 
1.2%
2020080532
 
0.8%
2020060831
 
0.8%
2020070829
 
0.7%
2020090720
 
0.5%
2020100518
 
0.5%
2020040118
 
0.5%
2021042018
 
0.5%
2020072017
 
0.4%
Other values (370)788
 
20.2%
(Missing)2705
69.3%
ValueCountFrequency (%)
20200318179
4.6%
202003271
 
< 0.1%
2020040118
 
0.5%
202004081
 
< 0.1%
2020040946
 
1.2%
202004131
 
< 0.1%
202004151
 
< 0.1%
2020050517
 
0.4%
202005062
 
0.1%
202005127
 
0.2%
ValueCountFrequency (%)
202210071
 
< 0.1%
202210031
 
< 0.1%
202209292
 
0.1%
202209281
 
< 0.1%
202209271
 
< 0.1%
202209262
 
0.1%
2022092315
0.4%
202209221
 
< 0.1%
202209202
 
0.1%
202209191
 
< 0.1%

Effective Date
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct244
Distinct (%)20.4%
Missing2705
Missing (%)69.3%
Infinite0
Infinite (%)0.0%
Mean20215230.69
Minimum20200201
Maximum20221007
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum20200201
5-th percentile20200608
Q120210702
median20220228
Q320220601
95-th percentile20220901
Maximum20221007
Range20806
Interquartile range (IQR)9899

Descriptive statistics

Standard deviation6806.491913
Coefficient of variation (CV)0.0003367011744
Kurtosis-0.3580692703
Mean20215230.69
Median Absolute Deviation (MAD)573
Skewness-0.9237217074
Sum2.417741591 × 1010
Variance46328332.17
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2022060170
 
1.8%
2022070162
 
1.6%
2022080159
 
1.5%
2022030254
 
1.4%
2022040151
 
1.3%
2022050145
 
1.2%
2022030145
 
1.2%
2021020141
 
1.1%
2022010139
 
1.0%
2020021732
 
0.8%
Other values (234)698
 
17.9%
(Missing)2705
69.3%
ValueCountFrequency (%)
202002019
 
0.2%
2020021732
0.8%
202003016
 
0.2%
202003191
 
< 0.1%
202004081
 
< 0.1%
202004151
 
< 0.1%
202005051
 
< 0.1%
202005061
 
< 0.1%
202005151
 
< 0.1%
202005292
 
0.1%
ValueCountFrequency (%)
202210072
 
0.1%
202210061
 
< 0.1%
202210051
 
< 0.1%
202210042
 
0.1%
202210031
 
< 0.1%
2022100211
0.3%
202209281
 
< 0.1%
202209242
 
0.1%
202209232
 
0.1%
202209201
 
< 0.1%

Nif
Categorical

HIGH CARDINALITY
UNIFORM
UNIQUE

Distinct3901
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size30.6 KiB
U27869056
 
1
33821719N
 
1
32616175J
 
1
32705980A
 
1
32725490D
 
1
Other values (3896)
3896 

Length

Max length13
Median length9
Mean length9.004101512
Min length9

Characters and Unicode

Total characters35125
Distinct characters37
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3901 ?
Unique (%)100.0%

Sample

1st rowU27869056
2nd rowB15057474
3rd rowB15399314
4th rowB15054109
5th rowA15056849

Common Values

ValueCountFrequency (%)
U278690561
 
< 0.1%
33821719N1
 
< 0.1%
32616175J1
 
< 0.1%
32705980A1
 
< 0.1%
32725490D1
 
< 0.1%
35591049K1
 
< 0.1%
J324121571
 
< 0.1%
32718189E1
 
< 0.1%
71508307L1
 
< 0.1%
A085079151
 
< 0.1%
Other values (3891)3891
99.7%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
u278690561
 
< 0.1%
b151198371
 
< 0.1%
g150370701
 
< 0.1%
b153993141
 
< 0.1%
b150541091
 
< 0.1%
a150568491
 
< 0.1%
b150257601
 
< 0.1%
p1503700e1
 
< 0.1%
p1505500g1
 
< 0.1%
b152250551
 
< 0.1%
Other values (3891)3891
99.7%

Most occurring characters

ValueCountFrequency (%)
33793
10.8%
23660
10.4%
03462
9.9%
73361
9.6%
63079
8.8%
13076
8.8%
43032
8.6%
52923
8.3%
92365
6.7%
82338
6.7%
Other values (27)4036
11.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number31089
88.5%
Uppercase Letter4033
 
11.5%
Space Separator2
 
< 0.1%
Lowercase Letter1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B1320
32.7%
A302
 
7.5%
G169
 
4.2%
X143
 
3.5%
P142
 
3.5%
E140
 
3.5%
J125
 
3.1%
Q123
 
3.0%
L115
 
2.9%
F113
 
2.8%
Other values (15)1341
33.3%
Decimal Number
ValueCountFrequency (%)
33793
12.2%
23660
11.8%
03462
11.1%
73361
10.8%
63079
9.9%
13076
9.9%
43032
9.8%
52923
9.4%
92365
7.6%
82338
7.5%
Space Separator
ValueCountFrequency (%)
2
100.0%
Lowercase Letter
ValueCountFrequency (%)
b1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common31091
88.5%
Latin4034
 
11.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
B1320
32.7%
A302
 
7.5%
G169
 
4.2%
X143
 
3.5%
P142
 
3.5%
E140
 
3.5%
J125
 
3.1%
Q123
 
3.0%
L115
 
2.9%
F113
 
2.8%
Other values (16)1342
33.3%
Common
ValueCountFrequency (%)
33793
12.2%
23660
11.8%
03462
11.1%
73361
10.8%
63079
9.9%
13076
9.9%
43032
9.8%
52923
9.4%
92365
7.6%
82338
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII35125
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33793
10.8%
23660
10.4%
03462
9.9%
73361
9.6%
63079
8.8%
13076
8.8%
43032
8.6%
52923
8.3%
92365
6.7%
82338
6.7%
Other values (27)4036
11.5%

ClientePadre
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct3901
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3376.734171
Minimum8
Maximum6378
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum8
5-th percentile277
Q12169
median3461
Q34864
95-th percentile6049
Maximum6378
Range6370
Interquartile range (IQR)2695

Descriptive statistics

Standard deviation1792.120434
Coefficient of variation (CV)0.530725945
Kurtosis-0.9967939709
Mean3376.734171
Median Absolute Deviation (MAD)1359
Skewness-0.21295503
Sum13172640
Variance3211695.649
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
81
 
< 0.1%
44261
 
< 0.1%
43811
 
< 0.1%
43841
 
< 0.1%
43861
 
< 0.1%
43881
 
< 0.1%
43901
 
< 0.1%
43911
 
< 0.1%
43921
 
< 0.1%
43931
 
< 0.1%
Other values (3891)3891
99.7%
ValueCountFrequency (%)
81
< 0.1%
111
< 0.1%
121
< 0.1%
131
< 0.1%
141
< 0.1%
151
< 0.1%
161
< 0.1%
171
< 0.1%
181
< 0.1%
191
< 0.1%
ValueCountFrequency (%)
63781
< 0.1%
63751
< 0.1%
63671
< 0.1%
63611
< 0.1%
63591
< 0.1%
63561
< 0.1%
63541
< 0.1%
63531
< 0.1%
63521
< 0.1%
63501
< 0.1%

IDCliente
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct3901
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33767344.86
Minimum80002
Maximum63780002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum80002
5-th percentile2770007
Q121690002
median34610002
Q348640002
95-th percentile60490002
Maximum63780002
Range63700000
Interquartile range (IQR)26950000

Descriptive statistics

Standard deviation17921202.89
Coefficient of variation (CV)0.5307258524
Kurtosis-0.9967941354
Mean33767344.86
Median Absolute Deviation (MAD)13590000
Skewness-0.2129548645
Sum1.317264123 × 1011
Variance3.211695129 × 1014
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
800021
 
< 0.1%
442600021
 
< 0.1%
438100021
 
< 0.1%
438400021
 
< 0.1%
438600021
 
< 0.1%
438800031
 
< 0.1%
439000021
 
< 0.1%
439100021
 
< 0.1%
439200021
 
< 0.1%
439300021
 
< 0.1%
Other values (3891)3891
99.7%
ValueCountFrequency (%)
800021
< 0.1%
1100071
< 0.1%
1200031
< 0.1%
1300301
< 0.1%
1400101
< 0.1%
1500031
< 0.1%
1600111
< 0.1%
1700041
< 0.1%
1800031
< 0.1%
1900021
< 0.1%
ValueCountFrequency (%)
637800021
< 0.1%
637500021
< 0.1%
636700021
< 0.1%
636100021
< 0.1%
635900021
< 0.1%
635600021
< 0.1%
635400021
< 0.1%
635300021
< 0.1%
635200021
< 0.1%
635000021
< 0.1%

VENTASCLIENTE
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct2124
Distinct (%)54.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean536.5136132
Minimum-904.22
Maximum32216.492
Zeros52
Zeros (%)1.3%
Negative4
Negative (%)0.1%
Memory size30.6 KiB

Quantile statistics

Minimum-904.22
5-th percentile12.12
Q135.35
median91.91
Q3311.76
95-th percentile2298.9866
Maximum32216.492
Range33120.712
Interquartile range (IQR)276.41

Descriptive statistics

Standard deviation1816.182784
Coefficient of variation (CV)3.385156946
Kurtosis117.999962
Mean536.5136132
Median Absolute Deviation (MAD)71.71
Skewness9.305757657
Sum2092939.605
Variance3298519.906
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40.4140
 
3.6%
35.35130
 
3.3%
12.1274
 
1.9%
24.2461
 
1.6%
29.2960
 
1.5%
25.2558
 
1.5%
052
 
1.3%
30.346
 
1.2%
18.1842
 
1.1%
20.241
 
1.1%
Other values (2114)3197
82.0%
ValueCountFrequency (%)
-904.221
 
< 0.1%
-545.4451
 
< 0.1%
-84.841
 
< 0.1%
-10.531
 
< 0.1%
052
1.3%
0.02021
 
< 0.1%
0.6061
 
< 0.1%
0.75751
 
< 0.1%
0.8081
 
< 0.1%
0.991
 
< 0.1%
ValueCountFrequency (%)
32216.4921
< 0.1%
31651.2271
< 0.1%
29955.6421
< 0.1%
29217.07331
< 0.1%
27892.5951
< 0.1%
21944.26051
< 0.1%
20148.2241
< 0.1%
19338.47571
< 0.1%
17788.49311
< 0.1%
15871.9561
< 0.1%

MBAvg
Real number (ℝ)

HIGH CORRELATION

Distinct2929
Distinct (%)75.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-190.0798781
Minimum-15207.6
Maximum101.356
Zeros0
Zeros (%)0.0%
Negative425
Negative (%)10.9%
Memory size30.6 KiB

Quantile statistics

Minimum-15207.6
5-th percentile-567.501
Q157.8131
median84.1748
Q392.5596
95-th percentile97.9745
Maximum101.356
Range15308.956
Interquartile range (IQR)34.7465

Descriptive statistics

Standard deviation1465.504305
Coefficient of variation (CV)-7.709939209
Kurtosis47.18214444
Mean-190.0798781
Median Absolute Deviation (MAD)10.8249
Skewness-6.750908172
Sum-741501.6045
Variance2147702.868
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.72846
 
1.2%
95.980117
 
0.4%
97.859116
 
0.4%
99.818714
 
0.4%
94.341514
 
0.4%
89.179313
 
0.3%
91.672613
 
0.3%
89.605413
 
0.3%
83.061412
 
0.3%
90.979511
 
0.3%
Other values (2919)3732
95.7%
ValueCountFrequency (%)
-15207.61
 
< 0.1%
-15086.21
 
< 0.1%
-15079.91
 
< 0.1%
-121863
0.1%
-12185.81
 
< 0.1%
-12185.72
 
0.1%
-11796.12
 
0.1%
-11791.76
0.2%
-11791.62
 
0.1%
-11606.51
 
< 0.1%
ValueCountFrequency (%)
101.3565
 
0.1%
99.93221
 
< 0.1%
99.9071
 
< 0.1%
99.88971
 
< 0.1%
99.818714
 
0.4%
99.79392
 
0.1%
99.77111
 
< 0.1%
99.76551
 
< 0.1%
99.72846
1.2%
99.57382
 
0.1%

MBMax
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct17
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99.76468905
Minimum66.3043
Maximum103.256
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum66.3043
5-th percentile99.456
Q199.456
median100
Q3100
95-th percentile100.103
Maximum103.256
Range36.9517
Interquartile range (IQR)0.544

Descriptive statistics

Standard deviation0.6855645268
Coefficient of variation (CV)0.006871815401
Kurtosis1509.1123
Mean99.76468905
Median Absolute Deviation (MAD)0.103
Skewness-32.60607807
Sum389182.052
Variance0.4699987203
MonotonicityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
99.4561683
43.1%
1001628
41.7%
100.103505
 
12.9%
99.912733
 
0.8%
99.872113
 
0.3%
99.571111
 
0.3%
103.2567
 
0.2%
99.60075
 
0.1%
99.92754
 
0.1%
99.98593
 
0.1%
Other values (7)9
 
0.2%
ValueCountFrequency (%)
66.30431
 
< 0.1%
87.11461
 
< 0.1%
87.82491
 
< 0.1%
96.03511
 
< 0.1%
98.58381
 
< 0.1%
99.45362
 
0.1%
99.4561683
43.1%
99.571111
 
0.3%
99.60075
 
0.1%
99.80872
 
0.1%
ValueCountFrequency (%)
103.2567
 
0.2%
100.103505
 
12.9%
1001628
41.7%
99.98593
 
0.1%
99.92754
 
0.1%
99.912733
 
0.8%
99.872113
 
0.3%
99.80872
 
0.1%
99.60075
 
0.1%
99.571111
 
0.3%

MBMin
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct598
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-1861.663046
Minimum-45434.1
Maximum99.5711
Zeros170
Zeros (%)4.4%
Negative1493
Negative (%)38.3%
Memory size30.6 KiB

Quantile statistics

Minimum-45434.1
5-th percentile-9667.18
Q1-111.962
median41.583
Q379.2665
95-th percentile96.0451
Maximum99.5711
Range45533.6711
Interquartile range (IQR)191.2285

Descriptive statistics

Standard deviation7828.047807
Coefficient of variation (CV)-4.204868235
Kurtosis23.97687916
Mean-1861.663046
Median Absolute Deviation (MAD)46.9013
Skewness-4.938765407
Sum-7262347.544
Variance61278332.48
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-111.962252
 
6.5%
0170
 
4.4%
-45434.1108
 
2.8%
88.484395
 
2.4%
-365.33588
 
2.3%
-16.577
 
2.0%
99.45673
 
1.9%
-17816.762
 
1.6%
-9667.1858
 
1.5%
-1206.0152
 
1.3%
Other values (588)2866
73.5%
ValueCountFrequency (%)
-45434.1108
2.8%
-30917.71
 
< 0.1%
-17816.762
1.6%
-14638.77
 
0.2%
-10057.14
 
0.1%
-9667.1858
1.5%
-8683.112
 
0.1%
-8412.824
 
0.1%
-4519.052
 
0.1%
-4440.641
 
< 0.1%
ValueCountFrequency (%)
99.57111
 
< 0.1%
99.45673
1.9%
99.45361
 
< 0.1%
99.37171
 
< 0.1%
99.27713
 
0.1%
99.24668
 
0.2%
99.22431
 
< 0.1%
99.14777
 
0.2%
98.933918
 
0.5%
98.75265
 
0.1%

NArticulos
Real number (ℝ≥0)

HIGH CORRELATION

Distinct571
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64.00517047
Minimum1
Maximum8443
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum1
5-th percentile2
Q13
median6
Q324
95-th percentile251.5
Maximum8443
Range8442
Interquartile range (IQR)21

Descriptive statistics

Standard deviation315.207344
Coefficient of variation (CV)4.924716889
Kurtosis340.8852706
Mean64.00517047
Median Absolute Deviation (MAD)4
Skewness15.82156944
Sum249684.17
Variance99355.66969
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2619
15.9%
3380
 
9.7%
4375
 
9.6%
5375
 
9.6%
6210
 
5.4%
7150
 
3.8%
9104
 
2.7%
8104
 
2.7%
1088
 
2.3%
1152
 
1.3%
Other values (561)1444
37.0%
ValueCountFrequency (%)
12
 
0.1%
2619
15.9%
3380
9.7%
4375
9.6%
5375
9.6%
6210
 
5.4%
6.251
 
< 0.1%
7150
 
3.8%
7.3751
 
< 0.1%
8104
 
2.7%
ValueCountFrequency (%)
84431
< 0.1%
8187.51
< 0.1%
7173.1251
< 0.1%
4748.6251
< 0.1%
38361
< 0.1%
34141
< 0.1%
3174.51
< 0.1%
3002.51
< 0.1%
2964.51
< 0.1%
2879.51
< 0.1%

NContratos
Real number (ℝ≥0)

HIGH CORRELATION

Distinct218
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.37605742
Minimum1
Maximum1538
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB

Quantile statistics

Minimum1
5-th percentile2
Q13
median6
Q313
95-th percentile84
Maximum1538
Range1537
Interquartile range (IQR)10

Descriptive statistics

Standard deviation70.44508922
Coefficient of variation (CV)3.29551366
Kurtosis186.5875972
Mean21.37605742
Median Absolute Deviation (MAD)3
Skewness11.46698484
Sum83388
Variance4962.510596
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2663
17.0%
3427
 
10.9%
4416
 
10.7%
5395
 
10.1%
6303
 
7.8%
7168
 
4.3%
9138
 
3.5%
8130
 
3.3%
10115
 
2.9%
1175
 
1.9%
Other values (208)1071
27.5%
ValueCountFrequency (%)
12
 
0.1%
2663
17.0%
3427
10.9%
4416
10.7%
5395
10.1%
6303
7.8%
7168
 
4.3%
8130
 
3.3%
9138
 
3.5%
10115
 
2.9%
ValueCountFrequency (%)
15382
0.1%
13741
< 0.1%
9881
< 0.1%
9841
< 0.1%
9471
< 0.1%
7631
< 0.1%
7501
< 0.1%
6981
< 0.1%
5761
< 0.1%
5741
< 0.1%

PrecMedArticulo
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct2351
Distinct (%)60.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.3535253
Minimum-24
Maximum1211
Zeros52
Zeros (%)1.3%
Negative4
Negative (%)0.1%
Memory size30.6 KiB

Quantile statistics

Minimum-24
5-th percentile2.422
Q18.8375
median14.39
Q325.702916
95-th percentile49.32
Maximum1211
Range1235
Interquartile range (IQR)16.865416

Descriptive statistics

Standard deviation30.26522307
Coefficient of variation (CV)1.486976955
Kurtosis671.801925
Mean20.3535253
Median Absolute Deviation (MAD)7.182027
Skewness19.92168487
Sum79399.1022
Variance915.9837274
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6.0679
 
2.0%
20.278
 
2.0%
9.76333362
 
1.6%
10.159
 
1.5%
9.0959
 
1.5%
8.0853
 
1.4%
052
 
1.3%
7.0751
 
1.3%
15.1550
 
1.3%
12.62549
 
1.3%
Other values (2341)3309
84.8%
ValueCountFrequency (%)
-241
 
< 0.1%
-22.6493161
 
< 0.1%
-14.8792571
 
< 0.1%
-1.0684611
 
< 0.1%
052
1.3%
0.01011
 
< 0.1%
0.028751
 
< 0.1%
0.1041
 
< 0.1%
0.1591
 
< 0.1%
0.1595551
 
< 0.1%
ValueCountFrequency (%)
12111
< 0.1%
624.0157891
< 0.1%
371.8041
< 0.1%
331.281251
< 0.1%
319.2633331
< 0.1%
290.6822031
< 0.1%
290.41
< 0.1%
277.3733331
< 0.1%
260.061
< 0.1%
241.7476191
< 0.1%

DifCPrevCReal
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct181
Distinct (%)9.3%
Missing1951
Missing (%)50.0%
Infinite0
Infinite (%)0.0%
Mean10.49076923
Minimum-101
Maximum6908
Zeros231
Zeros (%)5.9%
Negative585
Negative (%)15.0%
Memory size30.6 KiB

Quantile statistics

Minimum-101
5-th percentile-28
Q1-3
median3
Q39
95-th percentile51
Maximum6908
Range7009
Interquartile range (IQR)12

Descriptive statistics

Standard deviation160.1156094
Coefficient of variation (CV)15.26252326
Kurtosis1769.534383
Mean10.49076923
Median Absolute Deviation (MAD)6
Skewness41.11657782
Sum20457
Variance25637.00838
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0231
 
5.9%
4113
 
2.9%
596
 
2.5%
691
 
2.3%
176
 
1.9%
375
 
1.9%
268
 
1.7%
768
 
1.7%
-158
 
1.5%
-348
 
1.2%
Other values (171)1026
26.3%
(Missing)1951
50.0%
ValueCountFrequency (%)
-1011
 
< 0.1%
-981
 
< 0.1%
-941
 
< 0.1%
-871
 
< 0.1%
-821
 
< 0.1%
-801
 
< 0.1%
-791
 
< 0.1%
-751
 
< 0.1%
-723
0.1%
-711
 
< 0.1%
ValueCountFrequency (%)
69081
< 0.1%
5771
< 0.1%
4241
< 0.1%
3661
< 0.1%
3571
< 0.1%
2431
< 0.1%
2401
< 0.1%
2102
0.1%
2031
< 0.1%
1961
< 0.1%

Interactions

Correlations

Auto

The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

Unnamed: 0Company NameAddressCityStatePostal CodeCredit Limit RequestedPayment Method Requested CodeCredit Limit GrantedCommercial Risk Cover ProtectedCommercial Risk Group CodePayment Method GrantedPayment Terms GrantedStatus CodeStatusClassification Decision CodeClassification DecisionRequest Entry DateEffective DateNifClientePadreIDClienteVENTASCLIENTEMBAvgMBMaxMBMinNArticulosNContratosPrecMedArticuloDifCPrevCReal
00NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNU278690568800024088.093232.83660100.0-1206.0100692.5022415.130683-3.0
11INDUSTRIAS CANDIDO HERMIDA SLO CALVARIO S/NVALDOVIEOLA CORUÑA15542.045000.099.00.00.00.00.00.02.0ANULADA58.0VOLUMEN DE RIESGO ELEVADO20200318.020200217.0B1505747411110007830.1700-3.13485100.0-816.4920121.001216.860909-39.0
22NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNB1539931412120003392.28003.70088100.0-630.395021.501920.3568427.0
33ESTRUCTURAS DE HORMIGON Y CONSTRUCCIONES ALPA SLVILLA SOLEDAD 2EL FERROLLA CORU A15404.045000.099.00.00.00.00.00.02.0ANULADA35.0PÉRDIDAS EN EL ÚLTIMO EJERCICIO DISPONIBLE20200318.020210731.0B15054109131300302402.720873.62650100.0-1206.0100373.0016511.9353988.0
44HIJOS JOSE LOSADA CANCELO SAPERBES 35EL FERROLLA CORU A15404.045000.099.045000.095.01.099.0180.066.0EN VIGOR23.0DECISION CESCE20210420.020220402.0A15056849141400102044.067531.62250100.0-816.4920202.509418.6514620.0
55CAMUYDE S.L.CTRA CEDEIRA KM ,NARONLA CORUñA15578.045000.099.00.00.00.00.00.02.0ANULADA65.0ANULACIÓN POR VENCIMIENTO DE LA VALIDEZ DEL SUPLEMENTO20201005.020220729.0B1502576015150003295.340082.80580100.017.421913.001322.7184617.0
66NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNP1503700E1616001129955.642062.08900100.0-1076.9800115.25103290.68220329.0
77NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNP1505500G17170004582.500091.34800100.057.1962155.00811.56250016.0
88GABADI SLO ESPIÑOFERROLLA CORUÑA15404.045000.099.020000.095.03.099.0180.066.0EN VIGOR30.0SITUACIÓN FINANCIERA AJUSTADA20220422.020220905.0B1522505518180003239.840092.60940100.078.114712.001219.986666-3.0
99FRANCISCO MATA SASAN PEDRO VIESNA LOURE 1LA CORU ALA CORU A15000.045000.099.045000.095.03.099.0180.066.0EN VIGOR23.0DECISION CESCE20200720.020220702.0A1503410119190002388.830025.93890100.0-469.717024.25939.475555-3.0

Last rows

Unnamed: 0Company NameAddressCityStatePostal CodeCredit Limit RequestedPayment Method Requested CodeCredit Limit GrantedCommercial Risk Cover ProtectedCommercial Risk Group CodePayment Method GrantedPayment Terms GrantedStatus CodeStatusClassification Decision CodeClassification DecisionRequest Entry DateEffective DateNifClientePadreIDClienteVENTASCLIENTEMBAvgMBMaxMBMinNArticulosNContratosPrecMedArticuloDifCPrevCReal
38913891GRABADOS Y CRISTALERIA MONCHO SLIND. GRELAS-BENS . CL GAMBRINUS , 00A CORUNALA CORUñA15008.045000.099.045000.095.05.099.0180.066.0EN VIGOR35.0PÉRDIDAS EN EL ÚLTIMO EJERCICIO DISPONIBLE20220927.020220801.0B15162357635063500002246.54-135.5050100.000-1076.98005.0549.3080000.0
38923892NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN32678967S63526352000244.44-84.308399.456-352.38103.0314.813333NaN
38933893NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN33994526C63536353000228.2892.7194100.00079.26654.047.070000NaN
38943894CONSTRUGENIA PROYECTOS Y OBRAS S.LC/MEJICO Nº10-2ºIZQ.A CORUÑALA CORUñA15009.045000.099.020000.095.04.099.0180.066.0EN VIGOR32.0DIMENSIÓN REDUCIDA20220929.020220801.0B7015776363546354000224.2495.1112100.00087.38088.083.030000NaN
38953895NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN32669675S63566356000227.2796.469799.45691.67053.039.090000NaN
38963896AGEITOS REGO SERVICIOS DE MANTENIMIENTO SL.EMILIA PARDO BAZAN, 5 - ENTRIBEIRALA CORUñA15960.045000.099.015000.095.05.099.0180.066.0EN VIGOR32.0DIMENSIÓN REDUCIDA20220928.020220801.0B7053011863596359000210.1033.152099.4560.00003.033.366666NaN
38973897NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN32716859A63616361000229.2993.606799.45688.48433.039.763333NaN
38983898CONTRATAS Y VENTAS SAUCL ASTURIAS 41OVIEDOASTURIAS33004.045000.099.045000.095.00.099.0180.066.0EN VIGOR0.0NaN20220929.020220801.0A3301421863676367000228.0062.310499.456-16.50008.074.000000NaN
38993899NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN32948867X63756375000219.1996.930199.45692.15493.036.396666NaN
39003900NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN32719041T63786378000258.58-3941.130099.456-9667.18004.0414.645000NaN